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AN INTRODUCTION TO QUANTUM COMPUTING 


NOSON S. YANOFSKY 


Abstract. Quantum Computing is a new and exciting field at the intersec¬ 
tion of mathematics, computer science and physics. It concerns a utilization of 
quantum mechanics to improve the efficiency of computation. Here we present 
a gentle introduction to some of the ideas in quantum computing. The paper 
begins by motivating the central ideas of quantum mechanics and quantum 
computation with simple toy models. From there we move on to a formal 
presentation of the small fraction of (finite dimensional) quantum mechanics 
that we will need for basic quantum computation. Central notions of quan¬ 
tum architecture (qubits and quantum gates) are described. The paper ends 
with a presentation of one of the simplest quantum algorithms: Deutsch’s algo¬ 
rithm. Our presentation demands neither advanced mathematics nor advanced 
physics. 
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1. Intuition 

Quantum Computing is a fascinating new field at the intersection of computer 
science, mathematics and physics. This field studies how to harness some of the 
strange aspects of quantum physics for use in computer science. Many of the texts 
to this field require knowledge of a large corpus of advanced mathematics or physics. 
We try to remedy this situation by presenting the basic ideas of quantum computing 
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understandable to anyone who has had a course in pre-calculus or discrete struc¬ 
tures. (A good course in linear algebra would help, but, the reader is reminded of 
many definitions in the footnotes.) 

The reason why we are able to ignore the higher mathematics and physics is that 
we do not aim to teach the reader all of quantum mechanics and all of quantum 
computing. Rather, we lower our aim to simply present that part necessary to 
offer a taste of what quantum computing is all about. What makes this possible 
is that we only need finite dimensional quantum mechanics, i.e., the vector spaces 
that represent the states of the system will only be of finite dimension. Such vector 
spaces consist of finite vectors with complex entries. These vectors will change by 
being multiplied by operators or matrices. These matrices will be finite and have 
complex entries. We do not do any more quantum computing than what is needed 
for our final goal: Deutsch’s algorithm. We stress that the reader does not need 
more than the ability to do matrix multiplication in order to understand this paper. 

To motivate our use of vectors to describe states and matrices as ways of describ¬ 
ing dynamics, we show that it is understandable if one looks at a basic toy models. 
Our models deal with childrens’ marbles moving along the edges of a graph. Every 
computer scientist and logician who has taken a class in discrete structures knows 
how to represent a (weighted) graph as an adjacency matrix. We shall take this 
basic idea and generalize it in several straightforward ways. While doing this, we 
shall present many concepts that are at the very core of quantum mechanics. 

We begin with graphs that are without weights and progress to graphs that are 
weighted with real numbers, and finally to graphs that are weighted with complex 
numbers. With this in hand, we present a graph theoretic version of the double-slit 
experiment. This is the most important experiment in quantum mechanics. We 
conclude with a discussion of ways of combining systems to yield larger systems. 

Throughout this chapter, we shall present an idea in a toy model, then generalize 
it to an abstract point and lastly discuss the connection with quantum mechanics 
before moving on to the next idea. 

This paper is based on a forthcoming text Quantum Computing for Computer 
Scientists coauthored with Mirco Mannucci. The text was accepted for publication 
by Cambridge University Press and should see the light of day in the beginning 
of 2008. In the text we take the reader through the same material and go much 
further. The reader who appreciates this paper, will definitely gain from the text. 

Acknowledgement. I am grateful to Dr. Mirco Mannucci for many helpful dis¬ 
cussions and cheery editing sessions. 


1.1. Classical Deterministic Systems. We begin with a simple system described 
by a graph along with some children’s marbles. Imagine the marbles as being on 
the vertices of the graph. The state of a system is described by how many marbles 
are on each vertex. For example, say that there are six vertices in the graph and a 
total of 27 marbles. 
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We might place six marbles on vertex 0, two marbles on vertex 1 and the rest 
as described by this picture. 


0* 



1 • 



•2 


1 


3 • 



4* 



•5 



We shall denote this state as A' = [6, 2,1,5,3,10] T . The states of such a system 
will simply be a collum vector of size 6. 

We are not only interested in states of the system, but also in the way that the 
states change — or the “dynamics” of the system. This can be represented by 
a graph with directed edges. The dynamics might be described by the following 
directed graph: 



The idea is that if there exists an arrow from vertex i to vertex j, then in one time 
click, all the marbles on vertex i will move to vertex j. We place the following 
restriction on the types of graphs we shall be concerned with: graphs with exactly 
one outgoing edge from each vertex. This will correspond to the notion of a classical 
determinstic system. At each time click the marbles will have exactly one place to 


go. 

This graph is equivalent to the matrix, M (for “marbles”): 


M 


0 

1 

2 

3 

4 

5 


0 1 
0 0 
0 0 
0 1 
0 0 
0 0 
1 0 


2 3 4 5 
0 0 0 0 
0 0 0 0 
0 0 0 1 
0 10 0 
10 0 0 
0 0 10 


where M[i,j] = 1 if and only if there is an arrow from vertex j to vertex iQ Our 
restricted class of graphs are related to the restricted class of boolean matrices that 
have exactly one 1 in each column. 


1 Although most texts might have M[i,j] = 1 if and only if there is an arrow from vertex i to 
vertex j, we shall need it to be the other way for reasons which will become apparent later. The 
difference is trivial. 
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Let us say we multiply M by a state of the system X = [6,2,1, 5,3,10] T . Then 
we have 


o 

0 

0 

0 

0 

o 


CO 


o 

0 

0 

0 

0 

0 

0 


2 


0 

0 

1 

0 

0 

0 

1 


1 


12 

0 

0 

0 

1 

0 

0 


5 


5 

0 

0 

1 

0 

0 

0 


3 


1 

1 

0 

0 

0 

1 

0 


10 


9 


What does this correspond to? If X describes the state of the system at time t, 
then Y is the state at time t + 1 , i.e., after one time click. We can see this clearly 
by looking at the formula for matrix multiplication: 

5 

Y[i\ = (. M * X) [i] = 4 x \ k \ ■ 

k =0 

In plain English, this states that the number of marbles that will reach vertex i after 
one time step is the sum of all the marbles that are on vertices with edges connecting 
to vertex i. Notice that the top two entries of Y are zero. This corresponds to the 
fact that there are no arrows going to vertex 0 or 1. 

In general if X = [xq, x\, ..., x n -\] T is a column vector corresponding to having 
x,j marbles on vertex i, M is a n by n Boolean matrix, and if MX = Y where 
Y = [yoi Uii ■ ■ • > Un-i] T , then there are yj marbles on vertex j in one time click. M 
is thus a way of describing how the state of the marbles can change from time t, to 
time t+ 1. 

As we shall soon see, (finite dimensional) quantum mechanics works in the same 
way. States of a system are represented by column vectors and the way in which 
the system changes in one time click is represented by matrices. Multiplying a 
matrix with a column vector yields a new state of the system. Quantum mechanics 
explores the way states of similar systems evolve over time. 


Returning to our marbles, let’s multiply M by itself. MM = M 2 . However, since 
our entries are Boolean, we shall multiply the matrices as Boolean, i.e., 1 + 1 = 
1 V 1 = 1. 
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0 

0 

0 

1 

0 
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Looking at the formula for Boolean matrix multiplication 

n— 1 

M 2 [i,j] = V M[i,k\AM[k,j] 

fc =0 

we can see that this formula really shows us how to go from vertex j to vertex i in 
two time clicks. 

And so we have that 

M 2 [i,j] = 1 if and only if there is a path of length 2 from vertex j to vertex i. 

For an arbitrary k we have 

M k [i,j] = 1 if and only if there is a path of length k from vertex j to vertex i. 

In general, multiplying an n by n matrix by itself several times will corre¬ 
spond to the whether there is a path after several time clicks. Consider X = 
[xq,Xi, ... ,x n -\] T to be the state where one has Xq marbles on vertex 0, X\ mar¬ 
bles on vertex 1, ..., x n -i marbles on vertex n — 1. Then, after k steps, the state 
of the marbles is Y where Y = [yo,Ui, ■ ■ ■ ,Vn- i] T = M k X. In other words, yj is 
the number of marbles on vertex j after k steps. 

In quantum mechanics, if there are two or more matrices or operators that 
manipulate states, then the action of one followed by another is described by their 
matrix product. We shall take different states of systems and multiply the states 
by various matrices (of the appropriate type) to obtain other states. These other 
states will again be multiplied by other matrices until we attain the desired state. 

1.2. Classical Probabilistic Systems. In quantum mechanics, neither the state 
of a system nor the dynamics of a system are deterministic. There is an indetermi¬ 
nacy in our knowledge of a state. Furthermore, the states change with probabilistic 
laws as opposed to deterministic laws. That means that states do not change in set 
ways. Rather, the laws are given by stating that states will change from one state 
to another state with a certain likelihood. 

In order to capture these probabilistic scenarios, let us generalize what we did in 
the last subsection. Instead of dealing with a bunch of marbles moving around, we 
shall deal with a single marble. The state of the system will tell us the probabilities 
of the single marble being on each vertex. For a three-vertex graph, a typical state 
might look like this X = [jp ^} T ■ This will correspond to the fact that there 
is a one-fiftl^] chance that the marble is on vertex 0; a three-tenth chance that the 
marble is on vertex 1; and a half chance that the marble is on vertex 2. Since the 
marble must be somewhere on the graph, the sum of the probabilities is 1. 

We must generalize the dynamics as well. Rather than exactly one arrow leaving 
each vertex, we will have several arrows leaving each vertex with non-negative real 
numbers between 0 and 1 as weights. These weights will describe the probability 
of the single marble going from one vertex to another in one time click. We shall 
restrict our attention to weighted graphs that satisfy the following two conditions: 
a) the sum of all the weights leaving a vertex is 1 and b) the sum of all the weights 
entering a vertex is 1. This will correspond to the fact that a marble must go 
someplace (there might be loops) and a marble must come from someplace. An 
example of such a graph is 


2 Although the theory works with any r S [0,1], we shall deal only with fractions. 
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The adjacency matrices for our graphs will have non-negative real entries where the 
sums of the rows and the sums of the columns are all 1. Such matrices are called 
“doubly stochastic matrices.” 

Let us see how the states interact with the dynamics. Suppose we have a state 

fi 1 2i T 

X = g, . We will calculate how a state changes: MX = Y 


0 0 
1 1 
3 2 

2 1 
L 3 3 



r 211 

30 

9 

30 

0 

L 30 J 


Notice that the sum of the entries of Y is 1. If we have X expressing the probability 
of the position of a marble, and M expressing the probability of the way the marble 

9 

30 

the probability of the marble’s location after moving. In other words, if X is the 
probability of the marble at time t, then MX is the probability of the marble at 
time t+1. 

If M is an n by n doubly stochastic matrix and X is an n by 1 column vec¬ 
tor whose entries sum to 1, then M k X = Y can be interpreted as expressing 
the probability of the position of a marble after k time clicks. That is, if X = 
[xq,Xi, ... ,x„_i] T means that there is an x t chance that a marble is on vertex i, 
then M k X = Y = [yo,yi, ■ ■ ■ ,y n - i] T means that after k time clicks, there is a yj 
chance that the marble is on vertex j. 

We are not constrained to multiply M by itself. We may also multiply M by 
another doubly stochastic matrix. Let M and N be two n by n doubly stochastic 
matrices. M * N will then describe a probability transition of going from time t to 
t T 1 to t T 2. 

In quantum computers, quantum systems are generally in a probabilistic state. 
Manipulating the system will correspond to multiplying the state by matrices. Each 
time click will correspond to one matrix multiplication. At the end of the compu¬ 
tation, the resulting vector will describe the state of the system. 


moves around, then MX = Y = 


is to be interpreted as expressing 
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Before moving on to the next section, we shall examine an interesting example. 
This shall be known as the “probabilistic double slit experiment.” Consider the 
following picture of a shooting gun. 







Q> 


CD 

cd 



Figure 1. Double slit experiment with bullets. 


There are two slits in the wall. The shooter is a good enough shot to always get 
the bullets through one of the two slits. There is a 50-50 chance of which slit the 
bullet will go through. Once a bullet is through a slit, there are three targets to the 
right of each slit that the bullet can hit with equal probability. The middle target 
can get hit in one of two ways: from the top slit going down, or from the bottom 
slit going up. It is assumed that it takes the bullet one time click to go from the 
gun to the wall and one time click to go from the wall to the targets. The picture 
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correspond to the following weighted graph. 



Notice that the vertex marked 5 can receive bullets from either of the two slits. 
Corresponding to this graph is the matrix B (for “bullets”) 


B = 


0 0 000000 
^0 000000 
^0 000000 
0^010000 
0^001000 
0^^00100 
00^00010 
0 o |ooooi 


In words, B describes the way a bullet will move after one 


time click {^] 


^ The matrix B is not a doubly stochastic matrix. The sum of the weights entering vertex 0 
is not 1. The sum of weights leaving vertices 3, 4, 5, 6, and 7 are more than 1. This fact should 
not bother you. We are interested in demonstrating the way probabilities behave with respect to 
these matrices. 
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Let us calculate the probabilities for the bullet’s position after two time clicks. 


B-kB = B 2 


0 0 0 0 0 0 
0 0 0 0 0 0 
0 0 0 0 0 0 

n • 1 •» 
1 1 0 0 1 0 

3 3 3 0 0 1 

1 0 1 0 0 0 

1 0 1 0 0 0 


0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
1 0 
0 1 


So B 2 indicates the probabilities of the bullet’s position after two time clicks. 

If we are sure that we start with the bullet in position 0, i.e., X = [1,0,0, 0,0,0,0, 0] 
then, after two time clicks, the state of the bullets will be 


B 2 X = [0,0,0, 


11111 
6 ’ 6 ’ 3 ’ 6 ’ 6 


T 


The key idea is to notice that R 2 [5,0] = g + ^ = jj because the bullets start from 
position 0, then there are two possible ways of the bullet getting to position 5. The 
possibilities sum to This is what we would expect. We shall revisit this example 
in the next subsection where strange things start happening! 


1.3. Quantum Systems. We are now ready to leave the world of classical proba¬ 
bilities and enter the world of the quantum. One of the central facts about quantum 
mechanics is that complex numbers play a major role in the calculations. Proba¬ 
bilities of states and transitions are not given as a real numbers p between 0 and 
1. Rather, they are given is a complex numbers c such that |c| 2 is a real numbei]^] 
between 0 and 1. 

What is the difference how the probabilities are given? What does it matter if 
a probability is given as a real number between 0 and 1, or as a complex number 
whose modulus squared is a real number between 0 and 1? The difference is — and 
this is the core of quantum theory — that real number probabilities can be added 
to obtain larger real numbers. In contrast, complex numbers can cancel each other 
and lower their probability. In detail, if p\ and P 2 are two real numbers between 
0 and 1, then (pi + p 2 ) > pi and (pi + p 2 ) > p 2 . Now let’s look at the complex 
case. Let ci and c 2 be two complex numbers with their squares of modulus |ci| 2 
and | C 2 1 2 - |ci + C 2 I 2 need not be bigger then |ci| 2 and it also does not need to be 
bigger than |c 2 1 2 . 

For exampl^j , if ci = 5 + 3* and c 2 = — 3 — 2 i, then |ci| 2 = 34 and |c 2 1 2 = 13 
but |ci + c 2 | 2 = |2 + i\ 2 = 5. 5 is less than 34 and 5 is less than 13. 


'We remind the reader that if c = a+bi is a complex number, then its modulus is |c| = \/a- f h 1 
and |c| 2 = a 2 + b 2 . 

®The important point here is that the modulus squared is positive. For simplicity of calcula¬ 
tions, we have chosen easy complex numbers. 
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This possibility of canceling out complex numbers corresponds to something 
called “interference” in quantum mechanics. One complex number might interfere 
with another. It is one of the most important ideas in quantum theory. 

Let us generalize our states and graphs from the previous subsection. Rather 
than insisting that the sum of the entries in the column vector is 1, we insist that 
the sum of the modulus squared of the entries is 1. This makes sense since we are 
considering the probability as the modulus squared. 

For dynamics, rather than talking about graphs with real number weights, we 
shall talk about graphs with complex number weights. Instead of insisting that the 
adjacency matrix of such a graph be a doubly stochastic matrix, we ask instead 
that the adjacency matrix be unitary]^] 

For example, consider the graph 



1 


V2 



The corresponding unitary adjacency matrix is 


U 


_±_ _±_ 

y/2 V2 

—i i 

V2 V2 

0 0 


0 

0 

i 


Unitary matrices are related to doubly stochastic matrices as follows. The mod¬ 
ulus squared of the all the complex entries in U forms a doubly stochastic matrix. 
The *,jth element in U is denoted U[i,j], and its modulus squared is denoted 
|C/[*,j]| 2 . By abuse of notation, we shall denote the entire matrix of modulus 
squares as \U[i, j]\ 2 : 


\U[i,j}\ 2 


1 

2 

1 

2 

0 


1 

2 

1 

2 

0 


0 

0 

1 


It is easy to see that this is a doubly stochastic matrix. 


®Let us just remember: a matrix U is unitary if U * C/'t = / = C/t * U. The adjoint of U, 
denoted as C/t, is defined as E/t = (U) T = ( U T ) or U^\j, k] = U[k,j\. 
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From this graph-theoretic point of view, it is easy to see what unitary means: 
the conjugate transpose of the U matrix is 


t/t = 


V2 

V2 

0 


i 

V2 

—i 

V2 

0 


-k= - 4 = 0 


-4= 0 


This matrix corresponds to the graph 


1 

V2 




—i 



V2 



If U is the matrix that takes a state from time t to time t + 1, then U' is the matrix 
that takes a state from time t to time t — 1. If we multiply U and U\ we get the 
identity matrix I 3 which corresponds to the graph 



1 


This means that if we perform some operation and then “undo” the operation, we 
will find ourselves in the same state as we began with probability 1. It is important 
to note that unitary does not only mean invertible. It means invertible in a very 
easy way, i.e., the inverse is the dagger of the matrix. This “invertibility” is again 
an important issue in quantum mechanics. Most of the dynamics will be invertible 
(except measurements). 



In order to see the interference phenomenon and the related “superposition” 
phenomenon, we will revisit the double-slit experiment from the last subsection. 
Rather than looking at bullets, which are relatively large objects and hence adhere 
to the laws of classical physics, we shall look at microscopic objects such as photons 
that follow the laws of quantum physics. Rather than having a gun, we shall have 
a laser shoot photons through two slits as in figure 2. 

Again we shall make the assumption that a photon will pass through one of the 
two slits. Each slit has a 50% chance of the photon going through it. To the left 
of each slit there are three measuring devices. It is assumed that it takes one time 
click to go from the laser to the slits and one time click to go from the slits to the 
targets. We are not interested in how large the slits are or how far the measuring 
positions are from the slits. Physicists are very adapt at calculating many different 
aspects of this experiment. We are only interested in the set-up. 
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Figure 2. Double slit experiment with photons. 

Consider the following weighted graph that describes the set-up of the experi¬ 
ment: 
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The modulus squared of is which corresponds to the fact that there is a 50-50 


V2 


chance of the photon going through either slit. 


±1 ±i 


V6 


= 4 which corresponds 


to the fact that whichever slit the photon goes through, there is a ^ of a chance of 
its hitting any of the three measuring positions to the right of that slit. Q 
The adjacency matrix for this graph is P (for “photons”)^] 


P 


0 

1 

V2 

V2 

0 

0 

0 

0 


0 

0 

0 


-1 + i 

V 6 

—1 — i 

V 6 

1 — i 

V6 

0 


0 0 


0 0 0 0 0 0 
0 0 0 0 0 0 

0 0 0 0 0 0 


0 1 0 0 0 0 


0 0 1 0 0 0 


-1 + i 


v/6 

— 1 — i 

V6 

1 — i 

V6 


0 0 10 0 
0 0 0 1 0 
0 0 0 0 1 


The modulus squared of the P matrix is exactly the same as the bullets matrix 
i.e., \P[i, j] | 2 = B. Let us see what happens if we calculate the transitions 


'The actual complex number weights are not our interest here. If we wanted to calculate the 
actual numbers, we would have to measure the width of the slits, the distance between the slits, 
the distance from the slits to the measuring devices etc. However, our goal here is to clearly 
demonstrate the interference phenomenon. And so we chose the above complex numbers simply 
because the modulus squared are exactly the same as the bullets case. 

H 'I’li is matrix is not a unitary matrix. Looking carefully at row 0, one can immediately see that 
P is not unitary. In our graph, there is nothing entering vertex 0. The reason why this matrix 
fails to be unitary is because we have not put in all the arrows in our graph. There are many more 
possible ways the photon can travel in a real-life physical situation. In particular, the photon 
might go from the right to the left. The diagram and matrix would become too complicated if we 
put in all the transitions. We are simply trying to demonstrate the interference phenomenon and 
we can accomplish that even with a matrix that is not quite unitary. 
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matrix after two time clicks. 









0 

0 

0 

0 

0 

0 

0 

0 


0 

0 

0 

0 

0 

0 

0 

0 


0 

0 

0 

0 

0 

0 

0 

0 


-1 + i 
VV2 

-1 + i 

V6 

0 

1 

0 

0 

0 

0 

P 2 = 

—1 — i 

Vl2 

— 1 — i 

V6 

0 

0 

1 

0 

0 

0 


0 

-1 + i 

V6 

1 — i 

V6 

0 

0 

1 

0 

0 


—1 — i 

Vl2 

0 

—1 — i 

V6 

0 

0 

0 

1 

0 


-1 + i 

Vu 

0 

1 — i 

V6 

0 

0 

0 

0 

1 


How do we interpret this in terms of probability? Let us look at the modulus 
squared of each of the entries. 


\P%j]\ 2 


0 0 000000 

0 0 000000 

0 0 000000 

jl^OlOOOO 
^^001000 
o||ooioo 
jlo^oooio 
jlo^ooooi 


This matrix is almost exactly the same as B 2 but with one glaring difference. 
S 2 [5,0] = because of the two ways of starting at position 0 and ending at position 
5. We added the nonnegative probabilities g ^ = g- However with a photon 
that follows the laws of quantum mechanics, the complex numbers are added as 
opposed to their probabilities. 




1 1 1 


— 1 + i 1 — i 0 

— 1 — 

vV 

{ V6 ) 

l + y2' 

l Ve) 

! yi2 Vu Vl2 


Thus giving us |P 2 [5, 0]| 2 = 0. In other words, although there are two ways of a 
photon going from vertex 0 to vertex 5, there will be no photon at vertex 5. 

How is one to understand this phenomenon? Physicists have a simple explanation 
for interference: waves. A familiar observation such as a pebble thrown into a 
pool of water will easily convince us that waves interfere, sometimes reinforcing 
each other, sometimes cancelling each other. In our experiment, photons are going 
through both slits at one time and they are canceling each other out at the middle 
measuring device. Thus, the double-slit experiment points to the wave-like nature 
of light. 
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The experiment can be done with only one photon shot out from vertex 0. This 
ensures that there will not be another wave for it to cancel out with. And yet, when 
only one photon goes through a slit, there is still an interference phenomenon. What 
is going on here? 

The naive probabilistic interpretation of the position of the photon following the 
bullet metaphor of the previous subsection is thus not entirely adequate. Let the 
state of the system be given by X = [co, Ci,..., c„_i] T eC". It is incorrect to say 
that the probability of the photon being in position k is |cfc| 2 . Rather, a system in 
state X means that the particle is in all positions simultaneously. It is only after 
we measure the photon that the chances of it being found in position k is | | 2 . 

The photon (or its associated wave) goes through the top slit and the bottom slit 
simultaneously. And when the photon exits both slits, it can cancel itself out. A 
photon is not in a singe position, rather it is in many positions or a “superposition”. 

This might cause some justifiable disbelief. After all, we do not see things in 
a superposition of states. Our everyday experience tells us that things are in one 
position or (exclusive or!) another position. How can this be? The reason why we 
see particles in exactly one position is because we have performed a measurement. 
When we measure something at the quantum level, the quantum object that we 
have measured is no longer in a superposition of states, rather it collapses to a 
single classical state. So we have to redefine what a state of a quantum system 
means: A system in state X means that after measuring the photon it will be in 
position k with probability |cfc| 2 . 

What are we to make of these strange ideas? Are we really to believe them? 
Richard Feynman in discussing the double-slit experiment [?] (Vol. Ill, Page 1-1) 
waxes lyrical: 


We choose to examine a phenomenon which is impossible, absolutely 
impossible, to explain in any classical way, and which has in it 
the heart of quantum mechanics. In reality, it contains the only 
mystery. We can not make the mystery go away by “explaining” 
how it works. We will just tell you how it works. 

It is exactly this superposition of states that is the real power behind quantum 
computing. Classical computers are in one state at every moment. Quantum 
computers can be put in a superposition of states. Imagine putting a computer in 
many different classical states at one time and then processing with all the states. 
This is the ultimate in parallel processing! Such a computer can only be conceived 
in the quantum world. 


1.4. Combining Systems. Quantum mechanics can also deal with systems that 
have more than one part. In this subsection we learn how to combine several 
systems into one. We shall talk about combining classical probabilistic systems. 
However, whatever is stated about classical probabilistic systems is also true for 
quantum systems. 



16 


NOSON S. YANOFSKY 


Consider two different marbles. Imagine that a red marble follows the probabil¬ 
ities of the graph whose corresponding adjacency matrix is 

n 1 5 1 

U 6 5 

111 
3 2 5 ' 

2 1 n 

3 3 u J 

Consider also a blue marble that follows the transitions given by the graph 




i.e., the matrix 


N 




What does a state for a two marble system look like? Since the red marble can 
be on one of three vertices and the blue marble can be on one of two vertices, there 
are 3x2 = 6 possible states of the combined system. This is the tensor product of 
a 3 by 1 vector with a 2 by 1 vector. A typical state might look like: 


1 

13 
0 
2 

13 
1 
3 
0 
1 
2 

which would correspond to the fact that there is a 

yg chance of the red marble being on vertex 1 and the blue marble 
being on vertex a , 

0 chance of the red marble being on vertex 1 and the blue marble 
being on vertex 6, 


2 chance of the red marble being on vertex 3 and the blue marble 
being on vertex 6. 

Now we may ask, how does a system with these two marbles change? What 
is its dynamics? Imagine that the red marble is on vertex 1 and the blue marble 
is on vertex a. We may write this as “la.” What is the probability of the state 
going from state la to state 26? Obviously, the red marble must go from vertex 
1 to vertex 2 and (multiply) the blue marble must go from vertex a to vertex b. 
The probability is g x g = g. In general, for a system to go from state ij to a 


Oa 

06 

la 

X = 

16 
2 a 
26 
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state i'j' we must multiply the probability of going from state i to state i' with the 
probability of going from state j to state f. 

M[i' ,i]xN[j' ,j] 

ij - i 3 ■ 


For the changes of all the states, we have to do this for all the entries. We are really 
giving the tensor product of two matrice^j 


M®N 


0 


1 2 


- 


1 

2 ' 


i 

2 


1 

2 

- 

0 


3 

3 

i 

3 

3 

5 

3 

3 



2 

1 

6 

2 

1 

B 

2 

1 




3 

3 J 


L 3 

3 J 


L 3 

3 . 



~ 

1 

2 ' 


1 

2 


1 

2 


1 


3 

3 

1 

3 

3 

1 

3 

3 


3 


2 

1 

2 

2 

1 

5 

2 

1 



- 

3 

3 . 


L 3 

3 J 


L 3 

3 . 



" 

1 

2 ' 


1 

2 


1 

2 


2 


3 

3 

1 

3 

3 

0 

3 

3 


3 


2 

1 

3 

2 

1 

2 

1 


- 

- 

3 

3 . 


L 3 

3 J 


. 3 

3 J 

- 



0a 

06 

la 

16 

2 a 

26 



Oa 


' 0 

0 

1 

13 

2 

IB 

5 

IB 

10- 

18 



06 


0 

0 

2 

IB 

1 

IB 

10 

IB 

5 

18 



la 


1 

2 

1 

2 

1 

2 




g 

g 

B 

B 

IB 

IB 



16 


2 

i 

2 

1 

2 

1 




g 

g 

B 

B 

IB 

IB 



2 a 


2 

0 

4 

g 

1 

9 

2 

B 

0 

0 



2 b 


4 

L 9 

2 

g 

2 

g 

1 

g 

0 

0 




The graph that corresponds to this matrix, Gm x Gn is called the Cartesian prod¬ 
uct of two weighted graphs. In quantum theory, the states of two systems are 
combined using the tensor product of two vectors and the dynamics of two systems 
are combined by using the tensor product of two matrices. The tensor product of 
the matrices will then act on the tensor product of the vectors. 

In general, the Cartesian product of an ?r-vertex graph with an n'-vertex graph 
is an (n x n^-vertex graph. If we have an n-vertex graph G and we are interested 
in m different marbles moving in this system, we would need to look at the graph 


G m = G x G x • • • x G 

X_ v _X 

m times 


^Formally, the tensor product of matrices is a function 


<g> : c mXm x C nXn 


• <S) C" 


_ ^ymnXmn 


and it is defined as 


(A <g> B)[j, k] = A[j /n, k/m] x B[j MOD n,k MOD m]. 
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which will have n m vertices. If Mg is the associated adjacency matrix, then we will 
be interested in 

M^ m = Mg <g> M g <g> • • • <g> M G 

" -V-' 

m times 

which will be a n m by n m matrix. 

One might think of a bit as a two vertex graph with a marble on the 0 vertex or 
a marble on the 1 vertex. If one were then interested in m bits, one would need a 
2 m vertex graph or equivalently a 2 m by 2 m matrix. So there is exponential growth 
of the resources needed for the amount of bits discussed. 

This exponential growth for a system is actually one of the main reasons why 
Richard Feynman started thinking about quantum computing. He realized that 
because of this exponential growth, it would be hard for a classical computer to 
simulate such a system. He asked whether a quantum computer with its ability to 
do massive parallel processing, might be able to simulate such a system. 

2. Basic Quantum Theory 

Armed with the intuition, we tackle the formal statement of quantum theory. 
A disclaimer is in order: We are only presenting a small part of finite dimensional 
quantum physics. There is no way that we can give more than a minute fraction of 
this magnificent subject in these few pages. It is a sincere hope that this will inspire 
the reader to go on and study more. For a mathematician, the best book to start 
reading about quantum mechanics is, of course, Dirac’s classic text [?]. However, 
there are many other primers available, e.g., [?, ?, ?] and [?]. The more advanced 
mathematician might want to look at [?]. 

2.1. States. An n dimensional quantum system is a system that can be observed 
in one of n possible states. Examples of such systems are 

• a particle can be in one of n positions; 

• a system might have one of n energy levels; 

• a photon might have one of n polarization directions. 

For clarity, lets talk of the first example. Lets say we have a particle that can be 
in one of n positions. 

The states of such a system shall be represented by collum vectors of n complex 
numbers. We shall denote these vectors with the “ket” | ) p^| notation: 

| <p} = [co, Cl, . . . , Cj , . . . , C n — i] . 

How is one to interpret these kets? Let us look at simple cases. The state 

IV') = [0,1,... ,0,... ,0]^ 

is to be thought of as saying that our particle will be found in position 1. The state 

|V/) = [0,0,...,1,...,0] T 

is to be interpreted that the particle is in position j. These two states are examples 
of what are called “pure states”. 

How is one to interpret an arbitrary 

W) = [c 0 ,Ci,...,Cj,...,C n _i] T ? 

is the second half of “bracket”. However we shall not use the “bra” part in our 

exposition. 
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Let S be the sum of the squares of modulus of the Cj, i.e., 

S = Ic 0 | 2 + Ici| 2 + ... + |c„_i| 2 . 

This is the length of the vector \<p). Then \tp) is to be interpreted that if one 
was to measure the state described by \<p) we would find the particle in position 0 
with probability | Co | 2 /-S', in position 1 with probability |ci| 2 /S, in position 2 with 
probability \c 2 \ 2 /S, ..in position n—1 with probability |c„_i| 2 /S' Such states are 
called “superpositions”. They say that the particle is in more than one “position” 
at a time. It is important to stress that \<p) means that the particle is in all positions 
simultaneously. It does not mean that the particle is in some single position and 
the Cj are giving us probabilities of which position. 


These superpositions can be added: if 


W) = [c 0 ,Ci,... 

1 Cj •> • • • 5 Cn— l] 

and 


W) = [co,ci,... 

c' c' \ T 

then 


1^) + W) = [co + Co, Cl + ci,. 

. . ,Cj + c'j,. . ., c n -1 + cUf- 

Also, if there is a complex number c € C, 

we can multiply a ket by this 

c|y>) = [c X C 0 ,C X Cl, . . . 

. ,C X Cj, . . . ,C X c n -i] T 


These operation satisfy all the properties of being a complex vector space. So the 
states of an n dimensional quantum system are represented by the complex vector 
space C n . 

Let us add a superposition to itself. 

, Cj + Cj 


,..., c n —i T c n _i] 
■ ,2c n _i] T . 


W) + W) = 2 I^) = [co + c 0 , Cl + Cl, . 

[2co, 2ci, • ■., 2Cj,. 

The sum of the modulus squared is 

S' = |2co|"+|2ci|“+.. .+|2c n _i| 2 = 2 2 |co| 2 +2 2 |ci| 2 +. . .+2 2 |c ra _i| 2 = 2 2 (| cq | 2 - 


Cl 


The chances of a particle being found in position j is 

12c■ 1 2 2 2 1 c ■ 1 2 I c-1 2 

= 2 2 (| c 0 1 2 + |ci | 2 + ... + |c n _i| 2 ) = |c 0 | 2 +|ci| 2 + ...+ |c n _ir 

In other words, the ket 2| <p) describes the same physical system as \ip). This makes 
sense, after all when we add two of the same superpositions, we do not expect any 
interference. We expect that they should reinforce each other. A similar analysis 
shows that for an arbitrary c £ C we have that the ket | tp) and the ket c\tp) 
describe the same physical state. Geometrically, we can say that the vector \tp) 
and the extension c\<p) describe the same physical state. So the only thing that is 
important is the direction of | ip) not the length of \ip). We might as well work with 
a “normalized” | <p), i.e., 

1^) 

\m 

which has length 1. In fact, in section 1, we only worked with normalized vectors. 

Given an n dimensional quantum system represented by C n and an m dimen¬ 
sional quantum system represented byC m , we can combine these two systems to 
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form one system. This one system is to represented by the tensor product of the 
two vector spaces: 


If \<p) is in the first system and | tp') is in the second system, then we represent the 
combined system as 

W) ® W) = I= \ w')- 

It is important to realize that, in general, there are more elements in the tensor 
product of the two systems than in the union of each of the two systems. States in 
C" 0 C m that cannot be represented simply as an element in C” and an element in 
C m are said to be “entangled”. 

2.2. Dynamics. Quantum systems are not static. The states of the system are 
constantly changing. Changes, or “operators”, on an n dimensional quantum sys¬ 
tem are represented by n by n unitary matrices. Given a state | ip) that represents 
a system at time t , then the system will be in state U\<p) at time t + 1. 

What does unitary really mean? If U\(p) = \<p') then we can easily form U 1 and 
multiply both sides of the equation by XJ^ to get U^U\<p) = U^\ip') or | ip) = U^\ip'). 
In other words, because U is unitary, there is a related matrix that can “undo” the 
action that U does. W takes the result of U ’s action and gets the original vector 
back. In the quantum world, most actions actions are “undoable” or “reversible” 
in such a manner. 

If U operates on C n and U' operates on C m , then t/0t/' will operate on <C n 0<C m 
in the following way: 

(u 0 u'){\ip) 0 |+')) = u\<p) 0 u'W). 

2.3. Observables. There are other types of operations that one can do to a n 
dimensional quantum system: one can observe, or “measure”, the system. When 
we measure a system, it is no longer in a superposition. The superposition is said 
to “collapse” to a pure state. 

1^) [on Ci,..., Cj,..., c n _i] \^p ) [0,0,..., 1,..., 0]. 

The question is which of the n pure states will the state collapse to? The answer 
is that it is random. Let S be the sum of all the squares of the modulus, i.e., 

S = | c 0 | 2 + |ci|“ + • • • + \Cj\~ + ■ ■ ■ + |c„_i| 2 . 

There is a |c 0 \ 2 /S of a chance of the superposition collapsing to the 0th pure state. 
There is a |ci \ 2 /S of a chance of the superposition collapsing to the 1th pure state, 
etc. There is no known mathematical way to decide which pure state the system 
will, in-fact, collapse to. 

An observable, or “measurement” on an n dimensional system is represented 
by an n by n hermitian matrix. We remind the reader that a n by n matrix 
A is hermitian, or “self-adjoint” if JV = A. In other words, A[j, k\ = A[k,j]. 
Equivalently A is hermitian if and only if A T = A. For example, the matrix 

5 4 + 5 i 6 — 16* 

4 - 5 i 13 7 

6 + 16* 7 -2.1 


is hermitian. 
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For a matrix A in C nx ™, if there is a number c in C and a vector | ip) in C n such 
that 

A\ip) = c#> 

then c is called an “eigenvalue” of A and | ip) is called an “eigenvector” of A associ¬ 
ated to c. The eigenvalues of a hermitian matrix are all real numbers. Furthermore, 
distinct eigenvectors which have distinct eigenvalues of any hermitian matrix are 
orthogonal. The hermitian matrices that represent observables for an n dimen¬ 
sional system have the further property that there are n distinct eigenvalues and n 
distinct eigenvectors. That means that the set of eigenvectors form a basis for the 
entire complex vector space that represents the quantum system we are interested 
in. Hence if we have an observable A and | ip) an eigenvalue of A then A\tp) = c\ip) 
for some c € C. c\<p) represents the same state as \ip) as we said before. So if the 
system is in an eigenvector of the basis, then the system will not change. 

3. Architecture 

In this section we are going to show how the ideas of quantum mechanics are 
going to influence our construction of quantum computers. In this paper we do not 
dwell on actual hardware implementations. Rather we shall look at the quantum 
generalizations of bits and logical gates. In section 3.1 we go from bits to qubits. 
We also discuss the notation that is needed for this. In section 3.2 we show how 
to look at classical computing as matrix manipulations. From the view afforded by 
this perspective, we easily generalize the notion of logical gate to quantum gate. 
There are many quantum gates, but we shall only look at a few that will be needed 
in the next section. 

3.1. Bits and Qubits. What is a bit? A bit is an atom of information that 
represents one of two disjoint situations. An example of a bit is electricity going 
through a circuit or electricity not going through a circuit; a switch turned on or a 
switch turned off; a way of saying true or false. All these examples are saying the 
same thing: a bit is a way of describing a system whose set of states is of size two. 

A bit can be represented by two 2 by 1 matrices. We shall represent 0—or better 
the state |0) as 


We shall represent a 1, or state |1) as: 


0 

1 


1 

0 


Since these are two different representations, we have an honest-to-goodness bit. 

A bit is either in state |0) or in state |1). That was sufficient for the classical 
world. But that is not sufficient for the quantum world. In that world we have 
situations where we are in one state and in the other state simultaneously. In the 
quantum world we have systems where a switch is in a superposition of states on 
and off. So we define a “quantum bit” or a “qubit” as a way of describing a 
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quantum system of dimension two. We shall represent any such qubit as a two by 
one matrix with complex numbers: 


0 

1 


co 

Cl 


where |co| 2 + |ci| 2 = 1. Notice that a classical bit is a special type of qubit. |co| 2 is 
to be interpreted as the probability that after measuring the qubit, it will be found 
in state |0). |ci| 2 is to be interpreted as the probability that after measuring the 
qubit it will be found in state |1). Whenever we measure a qubit, it automatically 
becomes a bit. So we shall never “see” a general qubit. Nevertheless, they do exist 
and they are the core of our tale. The power of quantum computers is due to the 
fact that a system can be in many states at the same time. 

Following the normalization procedure that we learned in the last section, any 
element of C 2 can be converted into a qubit. For example, the vector 


V = 


5 + 3i 
6 i 


has norm 


\v\ = VWx> = 


\ 


[5 - 3 i, —6 i] 


5 + 3i 
6 i 


= V34 + 36 = \/70. 


So V describes the same physical state as the qubit 

~ 5 + 3i 
_^ = x/70 

V70 _6i_ , 

L V70 J 

After measuring the qubit —^=, the probability of being in state |0) is yg, and the 
probability of being in state |1) is yg. 

It is easy to see that the bits |0) and |1) are the canonical basis of C 2 . So any 
qubit can be written as 


Co 


1 


0 

= c o |0) + ci|l) 

= Co • 


+ Cl • 


Cl 


0 


1 



Let us look at several ways of writing different qubits. -K= 

v 2 


can be written 


as 


1 

V2 

1 

V2. 
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Similarly 


72 



can be written as 


72 
-1 
V2 

It is important to realize that 

|0) + |1) _ |i) + |Q) 

72 72 

' J_ ' 

These are both ways of writing 7^ . In contrast, 

- 72 _ 

|0)-|1) y |1)-|0) 

_ 

- 72 . 

states are related: 

|0) — |1) n Ji) -10) 

72 1 j 72 ' 

How are qubits to be implemented? While this is not our focus, some examples 
of qubit implementations are given: 

• An electron might be in one of two different orbits around a nucleus of an 
atom. (Ground state and excited state.) 

• A photon might be in one of two different polarized states. 

• A subatomic particle might be in spinning in one of two different directions. 

There will be enough quantum indeterminacy and quantum superposition effects 
within all these systems to represent a qubit. 



= >uAi> = MH!>. 

72 72 72 


Computers with only one bit of storage are not very interesting. Similarly, we 
will need quantum devices with more then one qubit. Consider a byte, or eight 
bits. A typical byte might look like 

01101011 . 

We might also write it as 



Previously, we learned that in order to combine systems one should use the tensor 
product. We can describe the above byte as 

| 0 ) ® | 1 ) 0 | 1 ) 0 | 0 ) 0 | 1 ) 0 | 0 ) 0 | 1 ) 0 | 1 ). 
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As a qubit, this is an element of 

c 2 ®c 2 ®c 2 «c 2 ®c 2 ®c 2 ®c 2 ® c 2 . 

This vector space can be written as (C 2 )® 8 . This is a complex vector space of 
dimension 2 8 = 256. Since there is only one complex vector space (up to isomor¬ 
phism) of this dimension, this space is isomorphic to C 256 . 

Our byte can be described as in yet another way: As a 2 8 = 256 row vector 


00000000 0 
00000001 0 

01101010 0 
01101011 1 
01101100 0 

11111110 0 
11111111 [o 

This is fine for the classical world. However, for the quantum world, a general 
qubit can be written as 


00000000 T c 0 

00000001 ci 

01101010 cioe 
01101011 ciot 
01101100 cios 

11111110 C254 

11111111 U255 

where £?=o |c 4 | 2 = 1. 

In the classical world, you need to write the state of each of the eight bits. This 
amounts to writing eight bits. In the quantum world, a state of eight qubits is 
given by writing 256 complex numbers. This exponential growth was one of the 
reasons why researchers started thinking about quantum computing. If you wanted 
to emulate a quantum computer with a 64 qubit register, you would need to store 
2 64 = 18,446,744,073,709,551,616 complex numbers. This is beyond our current 
ability. 
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Let us practice writing two qubits in ket notation. The qubits 


00 

01 

10 

11 

can be written as 

10 } < 

Since the tensor product is understood, 
or |01). The qubit corresponding to 


0 

1 

0 

0 

|1>- 

re might also write these qubits as |0,1) 


1 

73 


1 

0 

-1 

1 


can be written as 


1 

73 


| 00 > 


73 |10> + 73 |n) 


I oo) ~ I io) +1 ii) 

73 


The tensor product of two states is not commutative. 


|0) ® |1) = |o, 1) = |01) ± 110) = |1,0) = |1) ® |0). 


The first ket says that the first qubit is in state 0 and the second qubit is in state 
1. The second ket says that first qubit is in state 1 and the second state is in state 

0. 

3.2. Classical Gates. Classical logical gates are ways of manipulating bits. Bits 
go into logical gates and bits come out. We represent n input bits as a 2" by 1 
matrix and m output bits as a 2 m by 1 matrix. How should we represent our logical 
gates? A 2 m by 2" matrix takes a 2" by 1 matrix and outputs a 2 m by 1 matrix. 
In symbols: 

(2 m x 2 n )(2 n x 1) = (2 m x 1). 

So bits will be represented by column vectors and logic gates will be represented 
by matrices. 

Let us do a simple example. Consider the NOT gate. NOT takes as input one 
bit, or a 2 by 1 matrix, and outputs one bit, or a 2 by 1 matrix. NOT of |0) equals 
|1) and NOT of |1) equals |0). The matrix 


NOT = 


0 1 
1 0 


This matrix satisfies 


0 1 


1 


1 

o 

1_ 


0 1 


1 

O 

i_ 


l 

1 0 


1 

o 

_1 


1 


1 0 


l 


i 

o 

_1 


which is exactly what we want. 
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What about the other gates? The other gates will be given by the following 
matrices: 


AND 

NAND 

OR 

NOR 


1110 

0 0 0 1 



0 0 0 1 

1110 



10 0 0 

0 111 



0 111 

10 0 0 



When we perform a computation, often we have to carry out one operation 
followed by another. 



We shall call this performing “sequential” operations. If matrix A corresponds 
to performing an operation and matrix B corresponds to performing another opera¬ 
tion, then the multiplication matrix B-kA corresponds to performing the operation 
sequentially. 

Besides sequential operations, there are “parallel” operations: 



Here we are doing A to some bits and B to other bits. This will be represented by 


A® B, the tensor product of two matrices. 

Combination of sequential and parallel operations gates/matrices will be circuits. 
A starting point is the realization that the circuit 
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can be realized as 


(B * A) (g) ( B' ~k A') = {B® B') * (A ® A'). 

We will of course have some really complicated matrices, but they will all be de¬ 
composable into the sequential and parallel composition of simple gates. 

3.3. Quantum Gates. A quantum gate is simply any unitary matrix that manip¬ 
ulates qubits. There are some simple gates that are quantum gates. 

The “Hadamard matrix” 

_±_ _±_ 

V2 V2 

AL_ _l 

y/2 y/2 . 

The Hadamard matrix is also its own inverse. As it turns out, the Hadamard matrix 
is one of the most important matrices in quantum computing. 

Consider the following “controlled-not” gate. 


H =^ 

V2 


1 

1 


1 

-1 



I y) 


-e 


10 ©y)) 


This gate has two inputs and has two outputs. The top input is the control bit. 
It controls what the output will be. If \x) = |0), then the output of | y) will be the 
same as the input. If \x) = |1) then the output of | y) will be the opposite. If we 
write the top qubit first and then the bottom qubit, then the controlled-not gate 
takes | x,y) to \x,x ® y) where ® is the binary exclusive or operation. 

The matrix that corresponds to this reversible gate is 


00 

01 

10 

11 


00 01 10 
10 0 
0 10 
0 0 0 
0 0 1 


11 

0 

0 

1 

0 


One last piece of notation: we shall denote an observation (measurement) by the 
following “meter”: 
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There are many other quantum gates, but we shall not need them for our work 
in the next section. 


4. Deutsch’s Algorithm 

The simplest quantum algorithm is Deutsch’s algorithm which is a nice short 
algorithm that solves a slightly contrived problem. This algorithm is concerned 
with functions from the set {0,1} to the set {0,1}. There are four such functions 
which we might visualize as 


0 * I - *0 0 * 1 - *0 0 * *0 0 * *0 



!• •! 1*1 - ^.1 !• •! 1*1 - ^*1 


Call a function / : {0,1} —► {0,1}, “balanced” if /(0) ^ /(1)). In contrast, call 
a function “constant” if /(0) = /(1). Of the four functions, two are balanced and 
two are constant. 

Deutsch’s algorithm solves the following problem: Given a function / : {0,1} —> 
{0,1} as a black-box, where one can evaluate an input, but cannot “look inside” 
and “see” how the function is defined, tell if the function is balanced or constant. 

With a classical computer, one would have to first evaluate / on an input, then 
evaluate f on the second input and then compare the outputs. The point is that 
with a classical computer, / must be evaluated twice. Can we do better? 

A quantum computer can be in two states at one time. We shall use this super¬ 
position of states to evaluate both inputs at one time. 


In classical computing, evaluating a given function / would correspond to per¬ 
forming the following operation 


x 


f 


.fix) 


Such a function could be thought of as a matrix. The function 


0*1 ->■ *0 



• 1 


is equivalent to the matrix 


1 

1 

0 


Multiplying either state |0) or state |1) on the right of this matrix would result in 
state |0). The column name is to be thought of as the input and the row name is 
to be thought of as the output. 
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However, this will not be good for a quantum system. For a quantum system 
we need a little something extra. A quantum system must be unitary (reversible). 
Given the output, we must be able to find the input. If / is the name of the 
function, then the following black-box Uf system will be the quantum gate that we 
shall employ to evaluate input: 



The top input, |ar), will be the qubit value that one wishes to evaluate and the 
bottom input, | y), controls the output. The top output will be the same as the 
input qubit |x) and the bottom output will be the qubit |y®/(a:)) where ® is XOR, 
the exclusive or operation. We are going to write from left to right the top qubit 
first and then the bottom. So we say that this function takes the state |x, y) to the 
state | x,y® /( x)). If y — 0 this simplifies: |x, 0) to \x, 0® f{x)) = | x,f(x)). This 
gate is reversible by simply looking at the following system 



State | x,y) goes to | x,y® f{x)) which further goes to 

\x, (:y ® f{x)) ffi /( x)) = | x,y® (f(x) ® f(x))) = \x, y ® 0) = \x, y ). 

In quantum systems, evaluating / is equivalent to multiplying a state of the 
input by a unitary matrix Uf. For the function 


0 * *0 



!• 


• 1 
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the corresponding unitary matrix is 


00 

01 

10 

11 


00 01 10 11 
0 10 0 
10 0 0 
0 0 10 
0 0 0 1 


Remember that the top column name correspond to the input | x,y) and the 
left-hand row name corresponds to the outputs \x, y © f(x)). A 1 in the xy column 
and the x'y' row means for input \x,y) the output will be | x\y'). 

So we are given such a matrix that expresses a function but we cannot “look 
inside” the matrix to “see” how it is defined. We are asked to determine if the 
function is balanced or constant. 


4.1. First Attempt. Let us take a first stab at a quantum algorithm to solve this 
problem. Rather than evaluating / twice, let us try our trick of superposition of 
states. Instead of having the top input to be either in state |0) or in state 11), we 
shall put the top input in state 

| 0 ) + | 1 ) 

V2 


which is “half-way” |0) and “half-way” |1). The Hadamard matrix can place a qubit 
in such a state. 


m 


1 

1 


- 


1 

72 

72 


1 


72 

1 

1 


0 


1 

L 72 

72 J 



L 72 J 


| 0 ) + | 1 ) 
72 


The obvious (and not necessarily correct) state to put the bottom input is as state 
|0). And so we have: 



fr fr fr 

Wo) Wi) W2) 


In terms of matrices this corresponds to 

t/X-ff® J)(|0>®|0». 
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We shall carefully examine the states of the system at every point. The system 
starts in 


M = (|0)®|0» = |0,0>. 

We then apply the Hadamard matrix only to the top input — leaving the bottom 
input alone — to get 


Wi) 


no) + ii)i 

. 72 . 


10 ) 


After multiplying with Uf we have 


| 0 , 0 ) + | 1 , 0 ) 

72 


1^2) 


|0,/(0)) + |l,/(l)) 

72 


For the function 0 i—> 1 and 1 i—s- 0 the state |^ 2 ) would be 


1^2) 


00 01 10 11 


' 0 

1 

1 

0 

0 

0 

0 ' 

0 

00 

01 

r 1 1 
72 

0 

00 

01 

1_ 

0 

0 

1 

0 

10 

1 

10 


0 

0 

0 

1 

11 

72 

0 

11 

^0 

_1 


| 0 , 1 ) + | 1 , 0 ) 

72 


If we measure the top qubit, there will be a 50-50 chance of finding it in state |0) 
and a 50-50 chance of finding it in state |1). Similarly there is no real information 
to be gotten by measuring the bottom qubit. So the obvious algorithm does not 
work. We need another trick. 


4.2. Second Attempt. Let us take another stab at solving our problem. Rather 
then leaving the bottom qubit in state 10), let us put it in the superposition state: 


| 0 )-| 1 ) 

72 


72 

72 J 


Notice the minus sign. Even though there is a negation, this state is also “half-way” 
in state |0) and “half-way” in state |1). The change of phase will help us get our 
desired results. We can get to this superposition of states by multiplying state |1) 
with the Hadamard matrix. We shall leave the top qubit as an ambiguous \x). 
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tr fr it 

7o) 7i) W2) 


In terms of matrices, this becomes: 

Uf(I <g) H)\x, 1). 

Let us look carefully at how the states of the qubits change. 

7o) = 7,D- 

After the Hadamard matrix, we have 


7i) = 17 

Applying Uf we get 

W 2 ) = 17 


-|1) 


\ x > 0) - \x, 1) 


72 

10 0/(7) - |i 0/(7) 
72 


72 


= 17 


1/(7) -1/(7) 
72 


where f(x) means the opposite of /( x). And so we have 


17 

[10)-ID 

L 72 


17 

[ID-10) 

L 72 


if /(7 = 0 


if /(7 = 1 


Remembering that a — b = (—1 )(b — a) we might write this as 


I ¥> 2 ) 


(-l) /(x) |7 


No) -11>] 
. 72 . 


What would happen if we evaluate either the top or the bottom state? Again, 
this does not really help us. We do not gain any information. The top qubit will 
be in state |7 and the bottom qubit will — with equal probability — be either in 
state |0) or in state |1). We need something more. 


4.3. Deutsch’s Algorithm. Now let us combine both of these attempts to actu¬ 
ally give Deutsch’s algorithm. Deutsch’s algorithm works by putting both the top 
and bottom qubits into a superposition. We will also put the results of the top 
qubit through a Hadamard matrix. 
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ft 

I Vo) 


ft 

Wi) 


ft 

l</?2> 


ft 

1 ^ 3 ) 


In terms of matrices this becomes: 

(7 ® H)Uf(H <g> 77)|0,1) 

At each point of the algorithm the states are as follows. 


bo) = |0,l). 




00 

[+ 2 I 

|0) + |1)1 

rio)-ii)i 

+ |0,0)-|0,1) + |1,0)-|1,1) _ 01 

1 

— 2 

V2 \ 

. V 2 . 

2 10 

+2 



11 

1 

L~2 J 


We saw from our last attempt at solving this problem, that when we put the 
bottom qubit into a superposition and then multiply by Uf we will be in a super¬ 
position 


(-l) /(x) l*> 


|Q)-|i) ~ 

V2 \ 


Now with \x) in a superposition, we have 


|W2> 


'(_l)/ (o) | 0 ) + (-l)/ (1) |l)' 

rio)-ii)i 

V2 

l V2 \ 


Let us carefully look at 

(-l) /(o) |0) + (-l) /(1)|1> . 

If / is constant this becomes either 


+1(|0} + |1» or -1(|0) + |1)) 

(depending on being constantly 0 or constantly 1.) If f is balanced it becomes 
either 


+1(|0>-|1» or - 1(|0) - |1» 
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(depending on which way it is balanced.) Summing up, we have that 


I+ 2 ) = { 


(± 1 ) 


(± 1 ) 


no)+ iDi 

[|0)-|1)1 

[ 72 J 

[ 72 J 

[10) -ID] 

r 10 ) - 11)1 

72 

72 


if / is constant 


if / is balanced 


Remembering that the Hadamard matrix is its own inverse and takes 


to |0) and takes 
to get 


| 0 ) + | 1 ) 
V2 

^ to 11), we apply the Hadamard matrix to the top qubit 

v 2 


I+3) = < 


(± 1 )| 0 > 


| 0 )-| 1 ) 

72 


if / is constant 


(±1)|1) 


if / is balanced 


\/2 

Now we simply measure the top qubit. If it is in state |0), then we know that / is 
a constant function, otherwise it is a balanced function. And we did this all with 
only one evaluation as opposed to the two evaluations that a classical algorithm 
demands. 

Notice that although the ±1 tells us even more information, namely which of the 
two balanced functions or which of the two constant functions, we are not privy to 
this information. Upon measuring, if the function is balanced, we will measure |1) 
regardless if the state was (—1)11) or (+1)|1). 

In conclusion, we have solved a problem that a classical computer would require 
two function evaluations. Deutsch’s quantum algorithm solved the same problem 
with one function evaluation. Other quantum algorithms are built-up with similar 
ideas to the ones presented here. 
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